Picture for Yulia Tsvetkov

Yulia Tsvetkov

MentorCollab: Selective Large-to-Small Inference-Time Guidance for Efficient Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Among Us: Measuring and Mitigating Malicious Contributions in Model Collaboration Systems

Add code
Feb 05, 2026
Viaarxiv icon

The Single-Multi Evolution Loop for Self-Improving Model Collaboration Systems

Add code
Feb 05, 2026
Viaarxiv icon

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

Add code
Feb 03, 2026
Viaarxiv icon

BASS: Benchmarking Audio LMs for Musical Structure and Semantic Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

MoCo: A One-Stop Shop for Model Collaboration Research

Add code
Jan 29, 2026
Viaarxiv icon

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Figure 1 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 2 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 3 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 4 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Viaarxiv icon

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Add code
Oct 27, 2025
Figure 1 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 2 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 3 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 4 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Viaarxiv icon

PrefPalette: Personalized Preference Modeling with Latent Attributes

Add code
Jul 17, 2025
Figure 1 for PrefPalette: Personalized Preference Modeling with Latent Attributes
Figure 2 for PrefPalette: Personalized Preference Modeling with Latent Attributes
Figure 3 for PrefPalette: Personalized Preference Modeling with Latent Attributes
Figure 4 for PrefPalette: Personalized Preference Modeling with Latent Attributes
Viaarxiv icon

Spurious Rewards: Rethinking Training Signals in RLVR

Add code
Jun 12, 2025
Figure 1 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 2 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 3 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 4 for Spurious Rewards: Rethinking Training Signals in RLVR
Viaarxiv icon